CDS

Accession Number TCMCG075C26681
gbkey CDS
Protein Id XP_017983299.1
Location complement(join(9602225..9602557,9602699..9602814,9602914..9602989,9603079..9603261,9604082..9604159,9604251..9604331,9604454..9604519,9604612..9604677,9604751..9604801,9604927..9605010,9606876..9606976,9607062..9607203,9607767..9607864,9608009..9608078,9608158..9608250))
Gene LOC18589060
GeneID 18589060
Organism Theobroma cacao

Protein

Length 545aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018127810.1
Definition PREDICTED: putative clathrin assembly protein At5g35200 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCAAGCACTCAGCATAGCTTAAGAAAAGCAATTGGGGCTCTGAAGGACTCCACAAAAGTTGGATTGGCAAAAGTAAATGGTGATAACAAGGGGCTAGATGTTGCAATTGTTAAGGCCACAAACCATAAAGAAAAAGTGCCAAAGGAGAAACATGTCAGGACTATCTTGATTGCAGTTTCAGCTTCAAGCCCTCGCTCTGATGTTGCTTTCTGCGTACACTCTCTTTTTAAGCGTCTTGCTAAGACACATACTTGGACGGTTGCATTGAAAACCTTGATCGTTGTACACCGTGCACTAAGGGAAGTTGATCCTTCATTTCACCAAGAGCTAATTACTCTTGGACGGGGCAGAGGTCTCATGCTGAACCTAGCACATTTCAGAGATGAATCAAGTTCACAAGCATGGGACTACTCTGCCTGGATTCGTAGATATGCTTTGTATCTCGAAGAGCGTCTAGAATGCTTCCACGAATTGAAATATGATGTTGATAAAGATCAGTCGAGAAACGGAAGGCTTGACACCCCGCATCTGATACGGCAGCTACCTGTCTTGCAAGAGCTTCTCCATCGCCTTCTTGCTTGCAAGCCAGAGGGGGCAGCTTTATGCAACCGCTTGATTCATTATGTTCTATCAATTGTTGCAGGCGAATGTGTTAACCTATACATTGCAATTACTGAGGGAATTTTAAATCTGGTTGACAAGTATTTCGAGATGCAGCACCACCATGCTGTTAAGGCACTTGAGATTTATCGGAAGGCAGGAAATCAGGCCTCACAGTTATCTGAGTTCTTTGAGATATGCAAGGGCCTTCACTATGGGCAAGGGCAAAAGTACCTTAAGATTAAACCGCTCCCTGCATCATTCCTGACTGCTATGGAAGATTACGTGAAGGAGGCTCCAGAAGTTTTGACGCTTCCATATAAAGCAATAAAGGATGATAATAAAGGTGCTGCTCCCACAGAAGTCCCTACTCCTAGATCTGATTTGTTAATAGATCATAACCAAGACACTGATGTTCAAGAAAAATCAAGCCCCTCTGTTACACCTTCGGACCAACCCCAGAGTGATCCAAGGCAGGGTGTTGCAAAGCTAGAGATTGCTGATCTTCTGTGCTTTGACGACCCACCTGAGGAAGGATCTGAACTGAATGACAAAAATTCCCTTGCTCTAGCAATTGTTGAATCTGAAGGTGTTTCAAGTGCTGGAAATGATGTCAGCTCAGCATCCGCAACTCCAAGTTGGGAGCTTGAACTTTTTAGTGCACCAAGCTCAAATGGAGCAGCTCTTGCAGAGAATAATGTGACTGGGAGATTGGACAGATTAACACTAGACAGTTTATACGATCAAGCAATAGCGAGTACTACACATCAGGATCGGGCATGCAACTTGGGTCAGGTGTCCACGAACCCTTTTGAGGTTGATTACGACCAGGATCCAATTTGTGGAAGCAGCGATGTCACACCTCCAACTGATGTGCAAATGGAAAGCATGGCTCAACAGCAAACTTACATCATGCAGCAGCAGCAGCAGCCTCCCATGGTTGGCTACGATTCAACAATCCCTTCCGGTAATCCCTTTGTCGAGCACAGCATGCCATCTCAGCCACCTGAGAATTCCTATTCTGGCTTAATTTAG
Protein:  
MASTQHSLRKAIGALKDSTKVGLAKVNGDNKGLDVAIVKATNHKEKVPKEKHVRTILIAVSASSPRSDVAFCVHSLFKRLAKTHTWTVALKTLIVVHRALREVDPSFHQELITLGRGRGLMLNLAHFRDESSSQAWDYSAWIRRYALYLEERLECFHELKYDVDKDQSRNGRLDTPHLIRQLPVLQELLHRLLACKPEGAALCNRLIHYVLSIVAGECVNLYIAITEGILNLVDKYFEMQHHHAVKALEIYRKAGNQASQLSEFFEICKGLHYGQGQKYLKIKPLPASFLTAMEDYVKEAPEVLTLPYKAIKDDNKGAAPTEVPTPRSDLLIDHNQDTDVQEKSSPSVTPSDQPQSDPRQGVAKLEIADLLCFDDPPEEGSELNDKNSLALAIVESEGVSSAGNDVSSASATPSWELELFSAPSSNGAALAENNVTGRLDRLTLDSLYDQAIASTTHQDRACNLGQVSTNPFEVDYDQDPICGSSDVTPPTDVQMESMAQQQTYIMQQQQQPPMVGYDSTIPSGNPFVEHSMPSQPPENSYSGLI